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Claims 1-78 lack an inventive step under PCT Article 33(3) as being obvious over Krane (US 5,799063 A). Krane teaches a 
conversational browser (Talk Web browser), comprising: a command and control interface for interpreting a user command (voice 
response unit, VRU) (col. 6, lines 20-27) and for generating a request to access one of a CML file (Talk Web page) and a CML 
application (control file / TWC files), wherein CML implements a conversational dialog for interaction with the user (col. 7, lines 1- 
40); and, a CML processor (speech recognition platform / VRU / index server / audio player) for parsing and interpreting one of a 
CML file and CML application to render the conversational dialog (col. 3, line 66-coI. 4, line 10; col. 5, lines 5-10). It would be 
obvious that since that web pages have meta-information stored within itself, that a Talk Web page would also have meta- 
information. 
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WHAT IS CLAIMED IS : 

1. A conversational browser, comprising: 

a command and control interface for interpreting a user command and for 
generating a request to access one of a CML (conversational markup language) file 
and a CML application, wherein CML comprises meta-information implementing a 
conversational dialog for interaction with the user; and 

a CML processor for parsing and interpreting one of a CML file and CML application to 
to render the conversational dialog. 

2. The conversational browser of claim 1, wherein the conversational browser 
executes on top of a platform that processes input/output (r/O) events under the control of the 
conversational browser to provide a conversational user interface in accordance with the 
rendered conversational dialog. 

3. The conversational browser of claim 2, wherein the platform is a conversational 
virtual machine. 



4. The conversational browser of claim 1, further comprising a dialog manager for 
managing and controlling the conversational dialog. 

5. The conversational browser of claim 4, wherein the dialog manager allocates 
conversational engines and conversational data files for rendering the conversational dialog 
comprising one of the CML file and the CML application and for processing input user 
commands. , 

6. The conversational browser of claim 5, wherein the dialog manager is one of 
controlled and programmed by meta-information comprising one of a CML file and CML 
application. 
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7. The conversational browser of claim 5, wherein the conversational engines 
comprise a speech recognition engine, a speaker recognition engine, a text-to-speech engine, a 
natural language understanding engine, a natural language generation engine, a dual tone 
multifrequency engine, a speech compression engine, and a combination thereof. 

8. The conversational browser of claim 1, where the input command comprise 
multi-modal input. 

9. The conversational browser of claim 1, wherein CML is implemented using a 
declarative format. 

10. The conversational browser of claim 1 , wherein CML encapsulates multi-modal 

dialog. 

1 1 . The conversational browser of claim 9, wherein the declarative format comprises 
an extensible markup language (XML) format. 

12. The conversational browser of claim 1, wherein CML is implemented using a 
combination of declarati ve and imperative components 

13. The conversational browser of claim 1 , wherein the conversational browser 
implements a "what you hear is what you can say" dialog format. 

14. The conversational browser of claim 1 , wherein the conversational browser 
implements a "say what you heard" dialog format. 

15. The conversational browser of claim 1 , wherein the conversational browser 
implements a "say what you will hear" dialog format. 

16. The conversational browser of claim 1 , wherein the conversational browser 
implements a mixed initiative dialog format. 
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17. A system for accessing information, comprising: 

a content server comprising one of content pages, applications, and a combination 
thereof, wherein the content pages and applications are implemented using a conversational 
markup language (CML) to describe a conversational dialog for interaction with a user; 

a conversational browser for processing one of a CML page and CML application 
received from the content server to render its conversational dialog; and 

a platform for processing input/output (I/O) events under the control of the 
conversational browser to provide a conversational user interface in accordance with the 
rendered conversational dialog. 

1 8. The system of claim 1 7, wherein the system comprises a IVR (interactive voice 
response) system, wherein a DJTMF application associated with the IVR system is implemented 
in CML. 

19. The system of claim 1 8, wherein the content server comprises the IVR system, 
and wherein the IVR system is accessible by the conversational browser over a packet-switched 
network using a standard network protocol. 

20. The system ofclaim IS, wherein the conversational browser and content server 
comprise the IVR system, and wherein the IVR system is accessible over a packet switched 
network using standard network protocol. 

21. The system of claim 18, wherein the IVR system comprises a gateway for 
capturing and compressing speech, and shipping the compressed speech to the content server 
comprising the IVR system for processing. 

22. The system of claim 18, wherein the input modality of the IVR system comprises 
one of speech, DTMF, and both, and wherein the output modality comprises speech. 

23. The system ofclaim 17, wherein CML is implemented using a declarative format. 
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24. The system of claim 1 7, wherein CML encapsulates multi-modal dialog. 

*• - 

25. The system of claim 23, wherein the declarative format comprises an extensible 
markup language (XML) format. 

26. The system of claim 17, wherein CML is implemented using a combination of 
declarative and imperative components. 

27. The system of claim 1 7, wherein the I/O events are multi-modal. 

28. The system of claim 1 7, wherein the I/O events are speech only. 

29. The system of claim 1 7, wherein the I/O events comprise one of synthesized 
speech output, playback of audio files, recognition of spoken input, recognition of DTMF (dual 
tone multifrequency) input, capturing of spoken input, capturing of text input, and a combination 
thereof. 

30. The system of claim 17, wherein the conversational browser one of comprises a 
CVM (conversational virtual machine) and executes on top of a CVM. 

3 1 . The system of claim 1 7, wherein the conversational browser and platform 
comprise an embedded architecture that operates in a pervasive computing client device. 

32. The system of claim 1 7, wherein the content server, conversational browser and 
platform one of execute on a single machine and are distributed over a communications network. 

33. The system of claim 17, wherein standard networking protocols arc utilized for 
accessing CML content pages and applications from the content server. 

34. The system of claim 33, wherein the standard networking protocols comprise one 
of TCP/IP (transmission control protocol/Internet protocol), HTTP (hypertext transmission 
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protocol), WAP (wireless application protocol), VoIP (voice over internet protocol), distributed 
network protocols, ad hoc network protocols, hopping network protocols, and a combination 
thereof. 

35. The system of claim 17, further comprising a transcoder for converting legacy 
information formats of the content server into CML based on predefined transcoding rules. 

36. The system of claim 35, wherein the transcoder interprets backend logic of the 
content server to generate a corresponding conversational dialog structure. 

37. The system of claim 35, wherein the transcoder executes on one of the 
conversational browser, a conversational proxy server, the content server, and distributed among 
combination thereof. 

38. The system of claim 35, wherein the conversational browser comprises a 
registration mechanism for registering one of its conversational capabilities, conversational state, 
and both, with one of the content server, transcoder and both. 

39. The system of claim 18, wherein the registration mechanism comprises one of a 
cookie, a form, procedural protocols, and a combination thereof. 

40. The system of claim 38, wherein the transcoder generates one of a custom CML 
file and CML application based on the conversational capabilities of the conversational browser. 

41. The system of claim 17, wherein the CML comprises a plurality of 
capability-based frames, each providing different dialog levels based on the conversational 
capabilities of the conversational browser. 

42. The system of claim 17, wherein CML comprises an active link that is one of 
spoken to and hidden from the user. 

-61- 
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43. The system of claim 17, wherein CML comprises a link to conversational data 
files Tor processing user input. 

44. The system of claim 1 7, wherein fcML comprises a link to at least one distributed 
conversational engine for processing user input. 

45. The system of claim 17, wherein CML comprises a link to an audio file for 
playback to a user. 

46. The system of claim 17, wherein CML comprises a confirmation request tag for 
confirming user input. 

47. The system of claim 17, wherein CML comprises TTS (text-to-speech) markup 
for specifying audio playback of text. 

48. The system of claim 1 7, wherein CML comprises one of scripting language and 
imperative code for executing conversational applications that direct their own dialog. 

49. The system of claim 17, wherein CML comprises a link to one of a plug-in, an 
applet, a dialog object, and a combination thereof, for executing a conversational task. 



50. The system of claim 17, wherein CML comprises one of (1) a top level element 
that groups other CML elements; (2) an element that specifies output to be spoken to the user (3) 
a menu element for encapsulating a menu that presents the user with a list of choices, wherein 
each choice is associated with a target address identifying a CML element to visit if the 
corresponding choice is selected; (4) a form element for encapsulating a form that allows the 
user to input at least on item of information and transmit the at least one item of information to 
a target address; and (5) a combination thereof. 
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5 1 . The system of claim 50, wherein a target address comprises one of a URL 
(uniform resource locator), a relative URL, a socket address, and a protocol identifier that 
identifies a desired protocol for establishing communication. 

52. The system of claim 51, wherein the target address is used for accessing one of a 
local and distributed conversational service. 

53. The system of claim 5 1 , wherein the target address is used for accessing one of a 
local and distributed conversational data file. 



54. The system of claim 50, wherein the form comprises input fields, and wherein a 
grammar is associated with each input field to specify appropriate responses for filling the 
corresponding input field. 

55. The system of claim 54, wherein each choice of the menu is associated with a 
grammar. 

56. The system of claim 55, wherein the menu and form comprise an attribute that 
specifies when their corresponding grammars are active. 

. 57. The system of claim 50, wherein the conversational browser employs a mixed 
initiative dialog feature. 

58. A method for accessing information, comprising the steps of: 

(a) processing an input command with at least one of a plurality of conversational 
engines; 

(b) generating a request based on the processed input command to access a CML 
(conversational markup language) file from a content server, the CML file comprising 
meta-information to implement a conversational dialog; 

(c) transmitting the request and accessing the requested CML file from a content server 
using a standard networking protocol; and 
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(d) processing the meta-information comprising the CML file to render the 
conversational dialog. 



59. The method of claim 58, wherein steps (b), (c) and (d) are executed by a 
conversational browser. 

60. The method of claim 59, wherein the conversational browser executes on top of 
an operating platform of a pervasive computing device. 

61 . The method of claim 58, wherein steps (a) and (d) are one of executed locally 
using a conversational engine and conversational arguments, executed remotely using 
distributed conversational engines and conversational arguments, and a combination thereof. 

62. The method of claim 59, further comprising the step of: 

registering one of conversational capabilities, conversational state, and both of the 
conversational browser with the content server. 

63. The method of claim 62, further comprising the step of customizing the CML file 
based on the conversational capabilities of the conversational browser.. 

64. The method of claim 58, further comprising the step of transcoding legacy 
content of the content server into CML based on predefined transcoding rules. 

65. The method of claim 64, wherein the step of transcoding is performed by one of a 
conversational browser, a transcoding proxy associated with a content server from which the 



legacy content is accessed, and a combination thereof. 

66. The method of claim 64, further comprising the step of processing backend logic 
of the content server to generate a corresponding conversational dialog structure. 
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67. The method of claim 58, wherein step (d) comprises the step of playing back an 
aucfio file, generating synthesized speech output, and a combination thereof. 

68. The method of claim 58, wherein CML is implemented using a declarative 

format. 

5 69. The method of claim 68, wherein the declarative format is based on XML 

(extensible markup language). 

The method of claim 58, wherein CML comprises multi-modal dialog. 

The method of claim 58, wherein CML comprises declarative and procedural 

10 72. The method of claim 58, wherein CML comprises one of (1) a top level element 

that groups other CML elements; (2) an element that specifies output to be spoken to the user (3) 
a menu element for encapsulating a menu that presents the user with a list of choices, wherein 
each choice is associated with a target address identifying a CML element to visit if the 
corresponding choice is selected; (4) a form element for encapsulating a form that allows the 

15 user to input at least on item of information and transmit the at least one item of information to 

. ^ a target address; and (5) a combination thereof. 

73. The method of claim 72, wherein a target address comprises one of a URL 
(uniform resource locator), a relative URL, a socket address, and a protocol identifier that 
identifies a desired protocol for establishing communication. 

20 74. The method of claim 73, wherein step (d) comprises the step of accessing one of 

a local and distributed conversational service associated with the target address. 



70. 
71. 

components. 
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75. The system of claim 72, wherein step (d) comprises the steps of: 

audibly outputting an introductory message of a form to a user, if a form element is 

visited; 

activating a grammar for each input field of the form, wherein the grammar specifies at 
5 least one appropriate value for the corresponding input field; and 

filling a field if the user speaks an appropriate response for the field. 

76. The method of claim 75, further comprising the steps of: 

audibly outputting an introductory message of a menu to a user, if a menu element is 

visited; 

10 audibly outputting prompt text of choice elements associated with the menu; 

activating a grammar for each choice element, wherein the grammar specifies a response 
for selecting the given choice element; and 

selecting a choice element if the spoken response of the user matches the grammar 
. associated with the choice element. 



15 



77. The method of claim 76, further comprising the step of activating the grammars 
of the menu and form for a predefined duration based on an attribute value specified by the menu 
and form elements. 



") 78. The method of claim 72, further comprising the step of employing a mixed 

20 initiative dialog feature for one of filling form input fields and selecting menu choices. 
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